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Abstract. This paper analyses stable commutator length in groups Z r * Z s . 

We bound scl from above in terms of the reduced wordlength (sharply in 
the limit) and from below in terms of the answer to an associated subset-sum 
type problem. Combining both estimates, we prove that, as m tends to infinity, 
words of reduced length m generically have scl arbitrarily close to — • 1. 

We then show that, unless P = NP, there is no polynomial time algorithm 
to compute scl of efficiently encoded words in Fg. 

All these results are obtained by exploiting the fundamental connection 
between scl and the geometry of certain rational polyhedra. Their extremal 
rays have been classified concisely and completely. However, we prove that a 
similar classification for extremal points is impossible in a very strong sense. 



1. Introduction 

Stable commutator length (hereafter scl) is a concept in geometric group theory 
which arises naturally in the study of least genus problems such as: 

Given a topological space X and a loop 7, what is the least genus 
of a once-punctured, orientable surface which can be mapped to X 
such that the boundary wraps once around 7? 

It transpires that the real-valued function scl gives an algebraic analogue of the 
(relative) Gromov- Thurston norm in topology and has deep connections to various 
areas of interest in modern geometry (see [3]). The computation of scl is notoriously 
difficult and its distribution often mysterious, even in free groups. Important open 
problems in the theory of scl in such groups are to determine the image of scl 
("inverse-problem"), and, more ambitiously, to find a clear relation between the 
outer form of a word and its scl ( "form-problem" ) . 

An a priori completely unrelated concept ubiquitous in the theory of linear 
optimization is that of a (convex) polyhedron and its boundary, the sail. If such a 
polyhedron P is pointed (i.e. does not contain any line), it has a particularly simple 
ray- vertex-decomposition as P = conc(i?) + conv(F), where R and V are the finite 
sets of extremal rays and points respectively (see Chapter 8 of [I]). Combinatorial 
optimization is often concerned with polyhedra whose elements represent flows, and 
which are given to us as the convex hulls of combinatorially distinguished flows (e.g. 
paths from source to sink, see Chapter 13 in [6]). In such cases, the description of 
V and R is a crucial step towards a complete understanding of the geometry of P. 

These two concepts were bridged by Calegari's algorithm (see [1]), which es- 
tablishes an intricate connection between the computation of scl in groups of the 
form *™ 1 17 1li and the geometry of certain rational flow-polyhedra. The sails of 
these polyhedra are the unit sets of one-homogeneous functions, which one has to 
maximize over certain subsets in order to compute scl. 
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There are two ways in which this link can be exploited: the relative approach 
compares the polyhedra corresponding to different words and converts geometric 
relations between them into numerical ones relating their scls. In contrast, the 
absolute approach uses the precise, very involved geometry of individual polyhedra 
to compute the scl of given words exactly. The former technique is significantly more 
accessible as it does not require such a detailed analysis. Amongst other things, 
it has been used to prove the salient Surgery Theorem (see Theorem 4.13 in [4]), 
which demonstrates that the scl of certain natural sequences of words converges. 

Following this method, we start off the first section of this paper by observing 
that certain linear-algebraic relations between exponents of words translate directly 
into inequalities of scl and then use this to relate the scl-images of different groups 
of the form Z r * Z s . More importantly, we combine both of the aforementioned 
approaches to obtain new upper and lower bounds and use these to prove that the 
scl of a generic word of reduced length m is close to ? - 1. 

Our lower bound implies that to prove the long-standing open conjecture that 
scl(Z * Z) D Q fl [1, oo), we can restrict our attention to a certain subclass of words. 

Our second main theorem shows that computing scl is hard: unless P = NP, the 
scl of a word cannot be determined in polynomial time. 

The second approach is more formidable, but promises more substantial progress 
towards a complete solution of the two guiding problems mentioned initially. An 
exhaustive analysis of the polyhedral geometry has been carried out in a few specific 
cases (see section 4.1 in [4), allowing the explicit computation of scl in several 
infinite families of words. These partial successes raised the hope that a complete 
description of the polyhedra was within reach. Indeed, the first half of their ray- 
vertex-decomposition was found by Calegari who provided a general and simple 
classification of their extremal rays (Lemma 4.11 of [4], see page 4). 

The main result of the second section of this paper demonstrates that the next 
step cannot be made: a similar classification for extremal points is impossible, 
roughly speaking because they exhibit provably arbitrarily complicated behaviour. 
We conclude the paper by showing that a natural alternative description of the 
relevant polyhedra is infeasible from a complexity-theoretic perspective. 

1.1. Main Results. We first use polyhedra to prove positive theorems on scl, and 
then we provide negative results explaining why certain nice descriptions of these 
polyhedra cannot exist. All words are assumed to lie in the commutator subgroup 
of Z°° * zHE to start in the left and to end in the right factor. This particular 
case comprises all words in all groups Z r * Z s . A word has reduced length m if 
it switches m times from one to the other factor of our free product. This notion 
generalises to all free products, and it differs from the classical wordlength, which 
counts the number of letters in a word. For the words we examine, m = 2n is even. 

In Section [21 we define stable commutator length (|2.1[) . and then give a detailed 
description of Calegari's algorithm, thereby introducing relevant terminology (|2.2p . 

In Section [3j we prove bounds on scl and the complexity of its computation. 
Words of reduced length m — 2n in Z°° * Z°° are most naturally expressed as: 

4>{x,y) =a Xl -b yi • ... a x " ■ b y " 



Here Z°° is free abelian group on countably many generators, which we denote by {ai}igpj 
in the left factor and by {feijigN in the right factor. For x G Z°° with components we WTite 
a x = af 1 ' ■ a| ( ' • a similar expression defines b x . 
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for x — {xj}, y = {yj} certain collections of nonzero vectors. 

We start by proving that values in the set scl(Z°° * Z°°)\scl(Z r * Z s ) cannot 
come from words that are "too short" : 

Compactness Lemma. If v € Z°° * Z°° has reduced length N, its scl is already 
contained in the image scl(Z r * If) for all r, s > N . 

More importantly, we give a lower bound on scl depending on the number of 
exponents we need to represent zero as a nontrivial sum (repetitions allowed). 

Lower Bound Theorem. Let w — <fi(x,y) G Z°° * Z°° have reduced length 2n. 
Fix p, q G N, and assume that the following two implications hold: 
If (Xj)j G N™\{0} is a vector with J2j ^o x< f^ ~ f or a ^ h then ^ . -\j > P- 
If (pj)j G N™\{0} is a vector with J^j P^jVj = f or a ^ h then J^j > Q- 
In this case, we have the inequality scl(u>) > ^(1 — i — |) 

For each length, intersecting the polyhedra of all words of this length yields an 
upper bound on scl which is "best possible in the limit" : 

Upper Bound Theorem. Write C(m) for the supremum of the scl of words in 
Z°° * Z°° of reduced length m = In > 4. Then this supremum is attained and 
satisfies 

{f — 1) if n odd 

n (n-l)!-l - f 
2 - n(n-2)!-2 ' ^ ^ CVCn 

Moreover, given e > 0, we have for m sufficiently large: ^ — 1 < C(m) < ^ — 1 + e 

To state our main result precisely, we need to define what we mean by a "generic 
property". Recall the map <p from above, which associates a word of reduced length 
m = 2n to pairs (x, y) of certain collections of vectors in the rank-(n — l)-module 

Definition 1.1. Let P be a property on words in the commutator of Z°° * Z°°. 
We say words of reduced length m = 2n generically satisfy P if there are finitely 
many submodules W±, Wi of V of ranks at most n — 2, such that whenever not 
all x^ 1 ' and not all y( % > lie in |J fe Wk, the property P(<f>{x, y)) holds. 

Welding the upper and the lower bound together, we conclude: 

Generic Word Theorem. Given any e > 0, we can choose N such that for all 
m > N , words w of reduced length m generically satisfy 



scl(w) G 



m m 

— 1, — 1 + e 

4 4 



For efficiently encoded words in F2 , all known scl algorithms are computationally 
expensive. Here we elucidate that such expenditure arises not through the fault of 
the algorithms but from the intrinsic difficulty of the determination of scl: 

Complexity Theorem. Unless P = NP, the scl of words <fi(x,y) G F2 cannot be 
computed in polynomial time in the input size of the vector (x,y). 

In Section IU we analyse the geometry of the relevant flow-poly hedra. In order to 
express the main result precisely, we need to clarify what we mean by the "abstract 
graph underlying a flow" . 
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Definition 1.2. Take the smallest equivalence relation on the class of (finite) multi- 
digraphs ( "MD-graphs" ) which is stable under subdivision of directed edges. 

An MD-graph is called abstract if it does not contain subdivided edges. Note 
that every MD-graph is equivalent to a unique abstract graph. 

Figure 1: The abstract graph of an MD-graph 

The underlying abstract graph of a flow is the abstract graph of its support^ 

The general classification of extremal rays obtained by Calegari (see Lemma 4.11 
in [3]) implies that the abstract graphs underlying extremal rays are of an elegant 
simplicity - they are all isomorphic to one of the following three MD-graphs: 

G GO <£> 

Figure 2: The three abstract graphs underlying extremal rays 

However, we prove that a classification of the extremal points which gives rise 
to any nontrivial restriction on the underlying abstract graphs cannot exist: 

Non-Classifiability Theorem. For every connected, nonempty, abstract MD- 
graph G, there is an (alternating) word w G Z * Z and an extremal point f of a 
flow-polyhedron associated to w such that G is the abstract graph underlying f . 

The polyhedra in Calegari's algorithm arise as P = conv(Z? + V), where V is the 
(understood) recession cone, and D is an infinite integral subset of V. The aim is 
to find an efficient representation, and a very natural alternative to the vertex-ray- 
decomposition is the essential decomposition: here, we use the minimal set T C D 
{essential vectors) with T + V = D + V to encode P. Our final theorem indicates 
that this decomposition is computationally infeasible: 

Essential Membership Theorem. The decision problem "Given a word w G F 2 
and a vector v in the corresponding cone, is v essential?" is coNP— complete. 

2. Background 

We give a review of some basic properties of scl and relevant previous work. 

2.0.1. Word-parametrisation <j>. First, we introduce effective notation for words in 
the group Z°° * Z°°, which is the fundamental group of the wedge of two spaces. 
Without losing generality for our purposes, we will assume that all mentioned words 
are elements of the commutator subgroup of G = A * B = Z°° * Z°° which start 
in A and end in B. The following is the central invariant of words in our group; it 
measures how often a loop switches from one space to another: 




'The support of a flow is the digraph induced by the edges with nonzero flow. 
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Definition 2.1. Every word w can be written as w — u\Vi...u n v n with Ui £ A\{1} 
and Vi £ £>\{1}. We define the reduced wordlength (or, more concisely, reduced 
length) of w to be 2n. 

We will examine the map cf> introduced in 11.11 in more detail. Let Uk (k £ N) be 
copies of the space U — {z £ Z n \ £V Zj = 0} and define 

M n ={{zW,zW,...)emU k Vj e{l,...,n} Bi£N:zf ) ^o\ 




For z £ M n , we write zj = (zj , ...). The map <f> from 1 1.1 1 then gives a bijection 
between M n x M n and words of reduced length 2n, which is given explicitly by: 

0(jb,v) = I < a 2 x ... • I bl 1 bl 1 ... j ■ ... ■ (a x " a 2 " ...J ■ [b\ n bl n ...J 

For a loop 7 suitably representing <f>{x,y), the integer at^ can be geometrically 
interpreted as the number of times our loop walks along the i th generator in the 
left space between the (2j — l) th and (2j) th passage through *. A shifted statement 

(i) 

holds for y - and the right space. 

2.1. Definition of Stable Commutator Length. We give a very accessible, 
algebraic definition and sketch an equivalent, more motivated topological one. 

Definition 2.2. Let G be a group and g £ [G,G]. The commutator length cl(g) is 
defined to be the least number of commutators in G whose product is g. 

We stabilise this definition and define the stable commutator length of g to be: 

u \ r cl (5") 
scl((?) = lim 

n— >oo Ji 

There is a close link between commutator length and the least-genus problem 
mentioned initially, which yields a purely topological definition of cl: 

Proposition 2.3. Let (X,x) be a pointed topological space with fundamental group 
G = ni(X, x). Assume moreover that 7 is a based loop with homotopy class g. Then 
cl(g) is the least genus of a once-punctured, orientable, compact, and connected 
surface S which can be mapped to X such that OS wraps once around 7. This 
follows directly from well-known classification of compact surfaces. 

This result can be extended to obtain a similar topological definition of scl. It 
describes scl as a measure of how simple a surface (rationally) bounding a given 
loop can be, where the meaning of "simple" is slightly tweaked: 

A map / : S — > X from a compact orientable surface S is called admissible for 
a loop 7 if / wraps the boundaries of S around 7. To such a map, we associate the 
quotient ^ J ^ m '°^ S: '^ ^ | where x(5 , J ) is the Euler characteristic of the connected 
components Sj of S, and n is the degree with which / wraps dS around 7. Then 
scl ([7]) is given by the infimum of this quotient over all admissible maps /. 

Moreover, scl can be extended to homologically trivial chains on our group G. 
One can use this to continuously extend scl to the group B\(G) of 1— boundaries in 
the real group homology of G. In many relevant cases, this extension even descends 
to a norm on a suitable quotient of B\(G). 

However, the precise formulation of both of these definitions requires more tech- 
nical care and we therefore refer the reader to the sections 2.1 and 2.6 in [3]. We 
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also recommend section 2.4, which establishes a close connection between scl and 
bounded cohomology. 

2.2. Calegari's Algorithm. This algorithm enables the computation of scl in 
free products of free abelian groups. For the sake of notational convenience, we 
will restrict ourselves to the specific case of two factors and to words rather than 
chains. Our group is then the fundamental group of the wedge X of two tori, hence 
we can represent its elements by loops in these spaces. 




Figure 3: Loop representing w — aib^a-^ 1 b 1 1 b 2 1 € Z * Z 2 with scl(w) = | 

The algorithm proceeds in three steps: To a given word (j)(x, y) of length m = 2n, 
it first associates two complete digraphs on n vertices. Special flows on these two 
graphs define two polyhedral cones equipped with 1— homogeneous functions. The 
sum of these functions then has to be maximized over a subset to compute scl. 

2.2.1. Sketch of proof. The proof of Calegari's algorithm exploits the topological 
nature of scl. Let 7 be a loop in X which nicely represents (f>(x,y). Given an 
admissible map / : S — > X, we cut our surface S along the preimage 

of the gluing point into two simple components. Therewith, we decouple the left 
and the right half of our loop temporarily. The combinatorics of the boundaries of 
the two simple components give rise to a pair of flow- vectors (v a, vb) on n vertices. 
This pair carries all the scl-relevant information we can extract from /. Homological 
triviality of the left and right half of 7 imply that (va , vb ) lies in the Cartesian 
product of two polyhedral cones V(x), V(y). These cones define the crucial flow- 
polyhedra P(x), P(y), whose boundaries are the unit sets of the 1— homogeneous 
Klein- functions n x , n y . A detailed analysis finally shows that scl can be computed 
by maximizing n x + n y over a certain compact subset of V{x) x V{y). 

In the remainder of this section, we will give a precise formulation of the very 
technical terms used in this sketch. We will give entirely self-contained definitions, 
which do not depend on the sketched topological background. 

2.2.2. Flow-polyhedron P. We introduce necessary graph-theoretic terminology: 

Definition 2.4. Let G n be the complete digraph with n vertices [n] — {l,...,n}. 

Given a vertex i e [n] and a map / : [a] 2 — > R on edges, we define the inflow and 

outflow of / at i by inflow.^/) = ^ f Jt and outflow.^/) = ^ /y. Here f tJ = f(i,j) 

j j 
denotes the value of the map / on the directed edge from i to j. 

A nonnegative map / on edges is called a flow if inflow^/) = outflow^/) at all 

2 

vertices. We write W n C R> for the cone of such flows. 

We now define a connectedness-notion on MD-graphs: 

Definition 2.5. An MD- graph is connected if for all vertices i, j, there is a directed 
path from i to j. The graph is weakly connected if replacing all directed edges by 
undirected ones turns it into a connected undirected graph. 
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Fix z € M n . The following objects are the key ingredients in the definition of P: 
Definition 2.6. We introduce the weight-function h z : W n — > R N on flows as: 

n 

(M/))i = 5>f outflow^/) 

3=1 

Its vanishing will mirror homological triviality of a half of the loop on the level of 
the representing vectors. 

The cone V(z) is defined as the set of flows for which h z vanishes. The nonzero 
integral vectors in V(z) with connected support form the set D(z) of disc-vectors. 

We are now in a position to define the initially mentioned, crucially important 
flow-polyhedron P(z) examined in our paper and the function k z it determines: 

Definition 2.7. The rational, a posteriori finite sided flow-polyhedron P(z) is 
defined as P{z) = conv(D(z) + V(z)) = conv(D(z)) + V[z). 
The sail S{z) is the boundary of this polyhedron. 

The Klein- function k z is the unique 1— homogeneous function satisfying: 

• If the ray [v] passing through v &V has [v] (~l S(z) = 0, then k z (v) = 0. 

• If v is the closest point to in [v] (~l S(z), then k(v) = 1. 

Roughly speaking, k z is the 1-homogeneous function whose unit set is the sail. 
There is a more practical definition of n z established in Lemma 3.10 of [4]: 

Lemma 2.8. For v G V(z), an admissible expression is defined to be a represen- 
tation of the form v — Y] j tjdj + v' , where dj G D(z), v' G V(z) and tj > 0. Then 
Kz{ v ) — sw P(^2jtj)j where the supremum runs over all admissible expressions. 

2.2.3. Calegari's formula. If the surface S rationally bounds [7] = (f>{x,y), the 
representing pair of vectors (va,vb) must not only lie in V{x) x V(y), but also be 
paired. This property reflects that the two simple components can be glued back 
together and is defined as follows: 

Definition 2.9. For (x, y) g M n x Af„, define the set of paired vectors to bed 

Y {x,y) = {(v A ,v B ) e V{x) x V(y) | Vz,j : (v A )ij = («s)(j-i)i} 
Moreover, we define the compact set of unit-outflow vectors as: 

Yi(a;,y) = {{v Al v B ) G Y(x,y)\\/i : outflow^w^) = outflow^ws) = 1} 
Notice that Y™ := Yi(x,y) is independent of (x,y) G M n x M n . 

We can finally reap the benefits of our endeavors and state Calegari's formula: 
Theorem 2.10. (Calegari) If g = 4>{x,y) G Z°° * Z°° for x,y G M n x M n> then: 

scl (ff) = I ( n - , max (k x (v a ) + k v (v b )) ) 

l \ {v A ;VB)€Y 1 {x,y) J 

3. Estimates 

This section is divided into three parts: we first pursue the relative approach 
with linear-algebraic means and then give upper and lower bounds which allow 
us to determine the generic value of scl. In the final section, we prove that the 
determination of scl is hard in a precise sense. 

^We adopt the convention that 1 — 1 = n here 
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3.1. Linear Algebra of Exponents. The first, easily proven bound relates the 
scl of certain words of same reduced length. 

Lemma 3.1. (Inequality) Let v = 4>(r,s) and w = 4>(t,u) e Z°° * Z°° have equal 
reduced length m = In, and assume that there are inclusions of Z-modules: 

<{r»}> C ({*«}) and ({*«}) C <{u»}) 

Then, we have the inequality 

sc\(v) < scl(w) 

Proof. Observe that for z £ M n and / a flow, the function h z vanishes on / if 
and only if the vector (outflow^/))., is perpendicular to all vectors {z^'}. This 
immediately gives the inclusion V(r) x V(s) 3 V(t) x V(u). With Lemma T2.81 we 
then see that K r + k s > Kt + n u , wherever defined. Since the sets over which we are 
maximising are both equal to Yf 1 , Calegari's formula gives the result. □ 

The lemma shows that the map scl o<f> : M n x M n — > K factors through spaces, 
i.e. that x ({s w }) = x implies sch>(r,s)) = scl(0(i,u)) 

Example 3.2. This property can help us in concrete cases, e.g. we can see that: 

scl^a^ bai a 2 ba\a 2 ba~{ a 2 b~ ) = scl^a.^" a^bai a2 a 3^ ba\a 2 a 3 bd^ a 2 a 3 b ) 

since (5, -3, 2, -4) = (2, -2, 3, -3) - (-3, 1, 1, 1). 

The last fact can be used to prove the Compactness Lemma, which touches the 
"inverse problem" . Assume mi is a word with short reduced length which uses a 
large number of generators of the free abelian factors. Then our next lemma shows 
that we actually only need a small number of generators to realise the scl of w: 

Lemma 3.3. (Compactness) If w = (f>(x,y) e Z°° * Z°° has reduced length 2N , its 
scl is already contained in the image scl(Z r * Z s ) for all r, s > N . 

Proof. Since Z N is a free module of rank N over a principal ideal domain, the two 
submodules generated by {x^} and {y^} respectively are free of rank at most N. 
Therefore, we can choose generating sets {n, rpf} and {s%, sjv}- 

Consider wq = <f>((rx, rjv, 0, ...), (si, sjv, 0, ...)). Since the map scl ocf) factors 
through spaces, we have scl(iu) = scl(iuo). We complete the proof by noticing that 
for all n, m > N, the word wo lies in the image of the obvious scl —preserving 
inclusion IT * Z s -> Z°° * Z°°. □ 

3.2. Generic Value of scl. In order to determine the generic behaviour of scl, we 
will need to bound it from above and below. 

The first theorem of this section provides a lower bound on scl in terms of a 
subset-sum type problem determined by the exponents of our word. The immediate 
relation between the outer form of the word and the type of the bound makes it 
particularly powerful, as can be seen in the remainder of this section. 

Theorem 3.4. (Lower Bound) Let w = <p(x, y) £ Z°° *Z°° have reduced length In. 
Fix p, q € N, and assume that the following two implications hold: 
If <E N™\{0} is a vector with J2 3 ^j x ^ = f or a ^ h then J2j > P- 
If G N™\{0} is a vector with J2j HjUj = f or a ^ h then J2j Mj > 1- 
In this case, we have the inequality scl(i«) > ^(1 ~ \ ~ \) 
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Proof. The main idea behind this proof is to use the given implications to show 
that the disc- vectors have to contain a "large amount of flow" . 

By compactness, there is a vector {va,vb) G Yi{x,y) maximising n x + k v . Let 
va = Ylk tkdk + v' be any admissible expression with tk > 0, dk g D{x), v' £ V(x). 

Claim. There is an inequality k x (va) < ~ 

Proof of claim. The proof proceeds in three steps: 

Step (1): By definition, the function h x vanishes on disc-vectors. This means 

that (h x (dk))i — J2j x j outflowj(dfc) = for all i, k. Using the first implication of 
the theorem, we conclude that J^j outflow^ (dfc) > p for all k. 

Step (2): Since (va,Vb) lies in the set Y\{x,y), we have outflowjfv^) = 1 
for all vertices j. Using our admissible expression from above, we conclude that 
J2 k tk outflowj(dfc) < 1 for all j. 

Step (3): We swap the order of summation: 

p t k < tk outflow j {dk ) = tk outflow^ {dk ) < n 

k k j j k 

By Lemma we then have k x {va) = sup(^ fc tk), where the supremum runs over 
all admissible expressions. This implies the claim. □ 

Similarly, we prove k v {vb) < — • The result follows by Calegari's formula 12.101 

□ 

A major open problem in the theory of scl is to prove the conjecture that sc^i 7 ^) 
contains every rational number q > 1. By Lemma 3.18 in [5], if s,t € sc^i^), then 
also s + t + | G F 2 . Writing q = | + [q — 1) + |, we see that the above conjecture 
is equivalent to: 

Conjecture 3.5. (Interval) The image scl(i<2) contains every rational number in 
the interval [1,2]. 

The next corollary of the Lower Bound Theorem gives an indication where to 
look for these scl- values: Either they come from short words or from words which 
walk along the same subloop twice in opposite directions (like in figure [3]) . 

Corollary 3.6. Let w — 4>{x, y) be a word of reduced length m = 2n with sc\{w) G 
[1,2]. Then either m < 24 or there are distinct indices j%, j in {l,...,n} such that 
we have one of the two following identities of vectors: 

Proof. Suppose that the second possible conclusion does not hold. Since all vectors 
(Xj)i, {y^)i are nonzero, we can apply the Lower Bound Theorem 13.41 with the 
values p = q = 3 to obtain 2 > scl(u>) > ^ □ 

After having bounded scl from below, we will now proceed to give an upper 
bound purely in terms of the reduced wordlength. This bound is sharp in the limit 
and will be the second key ingredient in the determination of the generic behaviour 
of scl in Theorem 13.91 
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Theorem 3.7. (Upper Bound) Write C(m) for the supremum of the scl of words 
in Z°° * Z°° of reduced length m = In > 4. Then this supremum is attained and 
satisfies 

f f-1, if n odd 

C(m) < < n (n-l)l-l f 

77 7 — L. n , it n even 

I 2 n(n— 2)! — 2 > J 

Moreover, given e > 0, we have for m sufficiently large: ^ — 1 < C(m) < ^ — 1 + e 

Proof. For a given reduced length 2n, we will intersect all cones V(x) to obtain a 
word w n whose scl is maximal for this length. More precisely, define z £ M n by: 

z« = (l -1 ... 0), ... = (1 ... -l),z(") =z<" +1 > = ... =0 

Then the universally bounding word is given by w n = (f>(z, z). 

For all x,y £ M„, we have: ({*»}), <{y«}> C <{zW}) = {/ e Z n |Ei/j = 0}- 
Therefore, the Inequality Lemma |3~T1 gives that scl(w) < scl(w n ) for all words w of 
reduced length 2n. 

We will now bound scl(u; n ). Notice that V{z) = (^| V{x) is exactly the set 

x£M n 

of flows / on G n for which outflow^/) is equal at all vertices i. In particular, all 
Hamiltonian cycles on {1, ...,n} are disc-vectors. We split cases: 

n odd: Let va be the flow corresponding to the cycle 1, 3, 5, 2, 4, 6, 1. We 
are then forced by the pairing condition to define vb to be the flow corresponding 
to 1, n, n — 1, 2, 1. Clearly, we have (va, vb) 6 Y", and since both vectors va, vb 
are disc-vectors, we also conclude that k z (va), k z (vb) > 1- Calegari's formula ?2. 101 
and the Lower Bound Theorem 13.41 then imply: 



71 

sc\(w n ) = - - 1 

n even: Let v be the vector obtained by adding the n possible rotations of: 
(n- 2)1-1, 

(rc- 2)1-1 
0( n _2)! 




(n-2)! 
(n-2)! (n-2)! 

Figure 4: Vector in R™ 2 ; the vertices {1, n} are arranged clockwise. The number 
attached to a vertex gives the value on the edge going into this vertex. 

Then v is a flow in V(z) with outflow, (1;) = n(n — 2)! — 2. Notice that v 
contains the flow which is obtained by summing up all possible Hamiltonian cycles 
except for l,2,...,n, 1. Since there are (n — 1)! — 1 of them, Lemma [2.81 implies 
that k z (v) > (n — 1)! — 1. We observe that (v,v) 6 Y(z,z) is a paired vector, so 
( v a, v b) '■— ( n ( n -2)\-2 ' n(n-2)\-2 ^ e Y\(z,z) lies in the set we need to maximise 
over. We now use Calegari's formula [2.10[ the 1— homogeneity of k z and the Lower 
Bound Theorem 13.41 to conclude that j — 1 < scl(w„) < § — n™n-2)\~-2 ' 

We finally observe that C(n) = scl(it> n ), which completes the proof. □ 

10 



ON THE COMPLEXITY OF SAILS 



Remark 3.8. For n — 4, we have scl(u;4) = |j = 2 — ^l^-i ( com P u tcd with the 
implementation of Calegari's algorithm by A. Walker [7]) and hence the estimate 
for even n is sharp in at least one case. A precise computation of C(n) = sc\(w n ) 
for general even n remains an open problem. 

Finally, we now have all the necessary tools at our disposal to prove the main 
theorem of this section and determine the generic behaviour of scl. Recall the 
definition 1 1.1 1 of generic properties. 

Theorem 3.9. (Generic Words) Given any e > 0, we can choose N such that for 
all m > N , words w of reduced length m generically satisfy 



scl(w) e 



m m 

1, 1 + e 

4 '4 



Proof. Let e > 0. By the Upper Bound Theorem 13.71 we can pick N such that 
for all 2n > N, words w of length 2n have scl(iu) < l| — 1 + e. Fix n and set 
V = {z 6 Z n |J^j Zj = 0}. For each of the finitely many vectors A € N" with 
< V Xj < n, define the space W\ by: 



W x = { z e V 



£V 3 =of =ker(^ I ■ £ 



Since the rows of this matrix are linearly independent, the submodule W\ has rank 
at most Ti — 2. 

Let w — <ft(x, y) be any word of reduced length 2n. If not all and not 
all j/ 1 ) lie in MWx, then the Lower Bound Theorem 13.41 helps us to prove that 

A 

scl(iy) > f — 1. The theorem follows. □ 

Remark 3.10. Calegari's algorithm also applies to free products of k > 2 free abelian 
groups, and it is natural to ask which results carry over to this more general setting. 
Let w be a word with m nontrivial loop segments in the i th group of our product. 

Again, we have k flow-polyhedra Pi,...,Pfc, where Pj is constructed from the 
exponents of letters in the i th group by the same procedure as before. The definition 
of the associated functions Ki, K n also carries over. To compute scl, we have to 
maximize their sum over a compact subset Y\ of the product of all Pj . This sum 
is obtained by first restricting to unit-outflow vectors, and then imposing a "gluing 
condition", whose exact form is more complicated than before (see [1]). 

The Lower Bound Theorem generalises: Assume we are given a word w of length 
m whose exponents in the i th free factor are J xi,.../x nj , and such that we need to 
sum at least pi of these exponents (with repetitions) to obtain zero. By the same 
estimates on k, as before, we obtain: 



sclH > — - - ^ — 



Pi 

However, the proof of the Upper Bound Theorem breaks down in this more 
general setting. Our strategy for k — 2 was to give every edge roughly the same 
weight so that the pairing condition holds independently of the precise form of the 
word. The vector obtained in this way had equal outflow at all vertices since both 
graphs had the same cardinality, so it was possible to rescale and obtain a required 
unit-outflow vector. 
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For k > 2, the cardinalities of the involved graphs are usually very different, and 
therefore this vector cannot be rescaled anymore to have unit outflow everywhere. 

Therefore the proof of the Generic Word Theorem does not generalise. We can 
still deduce from the generalised Lower Bound Theorem that words w with rij 
nontrivial loop segments in the i th group generically have scl(iu) > — §• 

3.3. Complexity of Computing scl. We will give a lower bound on the algorithm 
independent complexity of computing scl of efficiently encoded words in F2: 

Theorem 3.11. (Complexity) Unless P = NP, the scl of words (f>(x,y) G F 2 cannot 
be computed in polynomial time in the input size of the vector (x, y). 

After briefly reviewing basic complexity-theoretic notation and previous results 
on scl, the main aim of this section is to prove this Complexity Theorem with the 
techniques from the preceding section. 

Remark 3.12. An algorithm is polynomial if it runs in time polynomially bounded 
in terms of the size of its input. 

A decision (or promise) problem is said to be polynomially solvable if there is a 
polynomial algorithm that solves it. We say the problem lies in the class P. 

A decision (or promise) problem lies in the complexity class NP if a solution to 
the problem can be checked in polynomial time in the input size. The problem is 
said to be N P-completc if it lies in N P and if no other problem in this class is harder 
in a precise sense. 

The input size of a computational task is the number of bits required to encode 
the input binarily. For example, a vector (xi, x n ) of natural numbers requires 
roughly E^log^) bits. 

Remark 3.13. There is a simple measure of length in the free group F 2 = Z*Z other 
than the reduced length from 12.11 the classical length of a word is the number of 
letters in a shortest representation. This notion does not naturally extend to groups 
U * Z s for r > 1 or s > 1 as it depends on a choice of bases for the factors. The 
alternative scl-algorithm presented in section 4.1.7 of [3] shows that scl can be 
computed in polynomial time in the classical length. More precisely, if we encode 
words as binary strings with pairs of entries representing generators / their inverses, 
then we can compute scl in polynomial time in the size of this (very large) input. 

However, it is artificial and inefficient to waive the use of exponents and use unary 
coding, so for example to write aaaaababa~ 1 a~ 1 a~ 1 a~ 1 a~ 1 a~ 1 b~ 1 b~ 1 for the word 
a 5 baba~ 6 b~ 2 . But "using exponents" is just the colloquial term for encoding our 
words via the map tfi, adapted to F 2 . We will therefore consider the problem of 
computing scl((/)(x, y j) for x, y G (Z\{0}) n as input. Our proof will show that unless 
P = NP, there is no algorithm which solves this problem in polynomial time in the 
size of the vector (x,y). Notice that this is strictly stronger than just saying that 
unless P = NP, scl cannot be computed in polynomial time in the reduced length. 

3.3.1. Complexity-theoretic Notation. The following classical problem is known to 
be NP— complete and will be the starting point of our complexity-theoretic analysis: 

Problem 3.14. (SUBSET SUM) Given (7*1, ■»,r n ) G Z™. Is there a nonzero vector 
(Ai, A„) G {0, 1}" with Vj = 0? 

This problem can be modified in several different ways, and we will now give 
names to the variations we need. 
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We first restrict ourselves to cases where the whole input is known to sum up to 
zero and ask for proper nonempty subsets whose sum is zero: 

Problem 3.15. (SUBSET SUM') Given (r u ...,r n ) G Z n with £\r^ = 0. Is there 
a vector (Ai, A n ) G {0, 1}™ with < JT < n an d J2j ^j r j = 0? 

We vary this problem and allow the repeated use of individual r^'s, but keep the 
total number of employed r,-'s bounded: 

Problem 3.16. (VAR SUBSET SUM') Given (n,...,r n ) G Z n urctft ^ = °- /s 
there a vector (Ai, A„) € N"\{0} wz£/i < JT < 71 an ^ J2j ^j r j = 0? 

We combine two problems and obtain the following promise problem: 

Problem 3.17. (MIXED SUBSET SUM') Given (n,...,r„) G Z" mt/i = 

such that SUBSET SUM' holds iff VAR SUBSET SUM' does. Are they satisfied? 

Finally, we define the decision problem which will serve as the key gadget in the 
proof of the Complexity Theorem 13.111 

Problem 3.18. (SMALL SCL) Given a vector x G (Z\{0})" with J^jXj = 0. 
Define y = (1, .., 1, — (n — 1)) G M n . Is it true that sc\((j)(x, y)) < § — 1 ? 

3.3.2. Proof of Complexity Theorem. 

Proof. Our aim is to polynomially reduce SUBSET SUM to SMALL SCL. This 
proves the Complexity Theorem: if we could compute scl for words encoded with 
<fi in polynomial time in the input size of the vectors, then it would be possible to 
answer SMALL SCL and thus also SUBSET SUM in polynomial time. This would 
then imply P = NP. 

Our reduction passes through MIXED SUBSET SUM': A combinatorial argument 
due to F. Manners reduces SUBSET SUM to MIXED SUBSET SUM'. A proof is 
attached in the Appendix. 

Hence we are left with reducing MIXED SUBSET SUM' to SMALL SCL. This re- 
duction relies on the Lower Bound Theorem [331 and the following following relation 
between scl and the SUBSET SUM' problem: 

Lemma 3.19. Let x G (Z\{0})" be a vector with Y], Xj = 0. Assume there is a 
nonempty set J c {l,...,n} of size M < ^ with 'Y^j<^j x j ~ an d moreover that 
neither J nor J c are of the /ormQ {fc, k + 1}. Then we have: 

sella o...a oa o ) S 2 l (n - M — 1)! 2 

Proof. Set y = (1, 1, — (n — 1)). We will first give a unit-outflow vector va £ V( x ) 
with k x (va) > 2, and then find db 6 V(y) such that (va-, vb) is paired and k v (vb) 
is large enough. 

A pair of flow- vectors (c, d) on G n , the complete digraph on vertices {l,...,n}, 
shall be called J— pair if c represents a Hamiltonian cycle on J and d represents 
one such cycle on J c . Observe that we can pick N — (n — M — 1)! such J-pairs 
(ci, di ),..., (czv, <1n) so that their sum v — J2j( c j +4/) nas nonzero flow on all edges 
inside J and all edges inside J c . The assumptions of this theorem tell us that the 



In this Lemma, we work with addition modulo n, picking representatives in {1, ...,n\. 
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vector v £ V(x) decomposes into at least 2N disc-vectors since the relevant sums 
vanish. Using Lemma |2.8[ we obtain that vector va — has k x (va) > 2. Define 

(vB)ij = (v A )j(i + l) 

One checks easily that vb is a flow with unit outflow everywhere. Therefore, we 
know that vb 6 V(y) and (va,vb) € Yi(x,y). 

We are left with proving that k v (vb) > jj- It is enough to show that vb 
has connected support, since then Nvb is a disc- vector and we therefore have 
k v (Nvb) > 1. A digraph is called weakly connected if replacing all directed edges 
by undirected ones turns it into a connected graph. By Proposition 14.91 proven 
below, it is enough to show that the support of the flow vb is weakly connected. 

If I = {p, .., (p + s)} C J and (p — 1), (p + s + 1) ^ J, we say / is an interval 
in J. We have an obvious analogous definition for intervals in J c . Then {1, ...,n} 
decomposes into intervals I\, I m , where Ij is in J for j odd and in J c for j even. 

Our goal is to show that given an interval Ik = {p, + s} in J, all points in 
1^ lie in the same weakly connected component as p — 1. Split cases: 

Case (1): s = 0. In this case, {vb) p ( p -i) — ( v a)( p -i)(p+i) > and so p and 
p — 1 are weakly connected. 

Case (2): s > 1. By the form of J, we can pick g ^ p, (p + 1) with q g J. One 
checks that the edges indicated below have positive flow for Ug: 




Figure 5: Flow- vector in M™ ; the vertices {1, ...,n} are arranged clockwise 

Exactly the same argument holds for intervals in J c . Combining these two claims, 
we see that Ik and Ik-i lie in the same weakly connected component for all k. We 
go once around the circle to conclude that the weak support of vb is connected. □ 

We are now ready to reduce MIXED SUBSET SUM' to SMALL SCL: 

Lemma 3.20. // we can solve SMALL SCL in polynomial time, then we can also 
solve MIXED SUBSET SUM' in polynomial time. 

Proof. Given a problem instance (n, r n ) E Z n , we can check in polynomial time 
if there is a j with = or rj + =0. 
If this is not the case, we compute 

s = scl(a ri &...a r "- 1 6a r "b- ( ™- 1) ) 

If s < % — 1, then by the Lower Bound Theorem 13. 4[ we can deduce that 
the problem VAR SUBSET SUM'(n, r„) is true, hence so is MIXED SUBSET 
SUM'(n,...,r„). 
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If s > §, then we use Lemma [3.191 to conclude that SUBSET SUM'(rr, r„) 
and hence MIXED SUBSET SUM'(n, r n ) are both false. □ 

This concludes the proof of the Complexity Theorem 13.111 □ 



4. POLYHEDRA 

4.1. Non-Classifiability Theorem. Whereas the extremal rays of the scl— poly- 
hedra P(z) = conv(D(z) + V(z)) have been classified in a satisfactory manner (see 
Figure [2]), such a description could not be found for their extremal points. Our 
main theorem, whose proof will occupy most of this section, gives a reason for this: 

Theorem 4.1. (Non-classifiability) Let G be a connected MD-graph with M ver- 
tices and E > edges. Then there is an (alternating) word w G Z * Z of length 
m = 4(M + 3E^ ME+1 ) and an extremal point f of a flow-polyhedron associated to 
w such that G is the abstract graph underlying f . 

We start this section with a brief treatment of extremal points, and then present 
a more general version of the Non-Classifiability Theorem, deducing the specific 
case from there. In the third part, we then prove the generalised theorem. 

4.1.1. Extremal Points of Polyhedra. The next definition is central: 

Definition 4.2. Let S C W 1 be a subset. A vector x G S is called an extremal 
point of S if x cannot be written as a nontrivial convex combination of other vectors 

n 

in S, i.e. if d = AjWj for A, € K>0: /J Aj = 1, Vj G S implies Vj = d for all j. 

*=1 3 

Let E C Z™ be a set of integral vectors and P = conv(E) the polyhedron defined 
by its convex hull. The following is a very useful criterion for deciding whether a 
given point in E is extremal: 

Lemma 4.3. (Extremality Criterion) A vector d G P is an extremal point if and 
only if d G E and: 

For all N € N and d\, d^ G E : ^Nd = d± + .... + djy implies d\ = ... = d^ = d^j 

Proof. Clearly, an extremal point d must lie in E, else it would need to be a 
nontrivial convex combination of vectors in E. If (Nd = d\ + .... + d/v) is an 
expression as above, dividing by N gives a convex combination. We conclude di — d 
for all i. 

Conversely, assume d G E is a vector for which the above implication holds. An 
easy computation shows that d is extremal if and only if it cannot be written as a 
nontrivial convex combination of vectors in E. Suppose d — ^jdj is a convex 
combination with Xj > and dj G E for all j. 

If dj = d for all i, we are done. 

If not, we can bring all Xjdj with dj = d to the left hand side, rescale, and hence 
obtain a representation of d as a convex combination of vectors that are all different 
from d. Therefore, we may assume dj ^ d for all j. As the rational vector d lies in 
the real convex span of the rational points d±, dpj G Z", we know by Lemma 4 
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in [2] that d also lies in their rational convex span. Thus we can find pLj = ^- with 
Pj G No, qj 6N such that d = X] Mj^r Then, for A" = g^, we have: 



Nd=^Pi ( n* i * 

This is a sum of I J^J qj \ = N — — N vectors in E, so by the implication 

in the theorem, we have for all i with ^ that d — di. This is a contradiction. □ 

4.1.2. Generalised Non-Classifiability Theorem. We need new definitions to state 
the general version of the theorem. Recall Definition 11.21 of MD-graphs and of the 
function abst. We can associate polyhedra to edge-weights on graphs: 

Definition 4.4. Let G be an MD-graph with edge-weights w. Consider the set 
E w of nonzero (nonnegative) integral flows / for which ^ e f(e)w{e) = 0. Define 
polyhedron associated to w by Q w — coiw(E w ). 

As an interpretation, we can think of G as a country with cities and connecting 
streets, where transferring a food-unit over route e costs w(e) money-units. Then 
E w are the nonzero integral food-flows for which our selfless state does not earn or 
loose money with its road toll/subsidy system. 

We extend this definition to vertex-weights: 

Definition 4.5. Let v be a vertex-weight on G. Define an edge-weight w v by giving 
an edge from a to 6 the weight v(a). Set E v = E Wv and Q v = Q Wv . 

In our above motivation, this corresponds to the state adjusting its tolls to how 
desirable it is to leave certain cities. Notice that for x G (Z\{0})" a vertex-weight 
on the complete digraph G n , the set E v contains precisely the nonzero integral 
vectors in the set V(x) defined in Definition 12.61 

Recall that an MD-graph is reflexive if its weakly connected components agree 
with its connected components. In this situation, we can prove: 

Theorem 4.6. (Generalised Non-Classifiability) Let G be a reflexive abstract MD- 
graph with M vertices and E > edges. Let x G Z" be a vertex-weight on G n 
containing at least M + %E 3AIE+1 entries equal to 1 and at least M + 3E 3ME+1 
entries equal to —I. Then, there is an extremal point f of the polyhedron Q x whose 
abstract graph is G. 

We deduce the specific Non-Classifiability Theorem from this general case: 

Proof of \4- 1\ from \4-6\ Let G be a connected, so in particular reflexive, MD-graph 
with M vertices and E > edges. Consider the word w = (f>(x,x), where 

/ \ 

x= 1, 1, —1, — 1 

\n times n times / 

for A^ = M + 3E 3ME+1 . By the general version of the theorem, the polyhedron Q x 
contains an extremal point / for Q x whose underlying abstract graph is G. 
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Claim. With the notation introduced in Definition \2.b\ we have 

P{x) = com{D{x) + V{x)) C Q x 

Proof of claim. Given d+v £ D(x) + V(x), we can use Lemma 4.11 in [4] to express 
v as v = XjVj as a nonnegative linear combination of integral representatives vi 

j 

of the extremal rays of V. Then for some natural N > Aj, we have: 
d + ^^^^id + Nv^+fl-^^-Yid + O) 

This implies that d + v £ Q x - The claim follows as Q x is convex. □ 

Hence since / is an extremal point in Q x and lies in P(x), it follows immediately 
that d is also an extremal point for P(x) □ 

Remark 4.7. Theorem 14.11 is sharp in the sense that only connected graphs can 
occur as abstract graphs underlying extremal points of P{x). Indeed, all extremal 
points are disc-vectors as if v = d + e is extremal, with d £ D(x) and e £ V{x), 
then, by considering the expression v = \d-\- i(d + 2e), we can see that e = 0. 

We need a central definition before we can start the proof of the generalised 
theorem. We have seen in 11.21 that to every MD-graph G, we can associate an 
equivalent abstract graph, denoted by abst(G). The next definition describes how 
this construction extends to flows and weights. 

Definition 4.8. Given a flow g on a graph G with support S. The abstract graph 
abst(g) := abst(S) is naturally equipped with an induced flow /. 

If moreover G is equipped with an edge weight u : E{G) — > R, then we construct 
the induced weight w on abst(/) as follows: We can pass from a (finite) graph to its 
abstraction in finitely many steps by successively joining pairs of edges. Whenever 
we merge two edges e\, we give the new arising edge e i2 the weight u{e\) +u{e2j- 
This yields a well-defined weight w on the graph abst(/). 

Write abst(/, w) for the graph abst(/) with induced flow and edge-weight. 

4.2. Proof of Generalised Non-Classifiability Theorem. 

Proof. This rather long proof involves multiple steps. Before we fill out the details, 
we will give a rough sketch of how we turn a graph into an extremal point underlied 
by this graph. The deep reason which allows us to get such a strong control over 
extremal points via the Extremality Criterion 14.31 is that N positive integers sum- 
ming up to N all have to be equal to 1. Let G be a reflexive abstract MD-graph 
with M vertices and E > edges, together with a vertex- weight x £ Z™ as in the 
theorem. We proceed in three steps: 

Step (1): We find an integral flow / on the graph G, nonzero on all edges, such 
that there exists an edge e (drawn with a dotted line) with flow- value 1. Moreover, 
/ satisfies f(e') < E M on all edges e'. 

Step (2): We define an integral edge- weight w on G, which is negative on e 
and positive on all other edges. The number-theoretic properties of w are chosen 
to allow an application of the Extremality Criterion 14.31 More precisely, we will 
show that the flow / is an extremal point of the polyhedron Q w . 

Step (3): We implement the edge- weighted flowed graph (G, f, w): that means 
we find an integral flow g £ V(x) on G n such that supp(g) = G, the induced flow 
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of g is /, and the edge- weight on G induced from the weight w x on G n agrees with 
the weight w from Step (2). It then follows easily that g is the required extremal 
point of Q x . 

The following picture describes this construction in a simple example (notice 
that not all edges are drawn in the 4 th part.) 




Figure 6: Finding extremal point with given abstract graph. Here w\ = 10 6 + 1, 
w 2 = 10 6 + 10,w 3 = 10 6 + 10 2 ,w 4 = 10 6 + 10 3 ,W7 5 = 10 6 + 10 4 ,w 6 = -9,032,211 

We now provide the details: 

Step (1): To find the required flow, we need two lemmata. The first one charac- 
terises which graphs can appear as supports of flows. 

Proposition 4.9. Let G be an MD-graph with M vertices and E edges. Then G 
admits a flow f which is positive on all edges if and only if it is reflexive. Moreover, 
such a flow can be chosen to satisfy /(e) < E M . 

Proof. Let / be such a flow and assume that H is a weakly connected component of 
G. Write for the sum of all flows through edges from i to j. By finiteness, there 
is a connected component C in H without ingoing edges. But such a component 
would also have no outgoing edges by the following calculation: 

i£Cj£G iGCj&C ieC ]GG\C i£C 3 eG\C 

Hence, C is equal to H. 

If, conversely, G is a reflexive MD-graph on n vertices, we can consider the set 
S of all possible cycles on G (no repeated vertices; we allow cycles which use only 
one edge to go from a vertex back to itself). This set certainly contains at most 
E M elements. We obtain a flow / by adding all of these individual cycles in S. 
It is nonzero on all edges since we can complete every directed edge to a cycle by 
rcflcxivity. □ 

The next graph-theoretic lemma is the key tool in Step (1), since it will allow us 
to find the distinguished edge e. 

Lemma 4.10. Let G be a connected abstract MD-graph with at least one edge. 
Then there is an edge e such that G\{e} is still connected. 

Proof. We prove the claim by induction. 
If \G\ = 1,2, the statement holds trivially. 

If \G\ > 2, we have indegree(i>)+outdegree(v) > 3 for all vertices v in G since G is 
abstract. We call this the degree-condition. Choose a cycle C of length k > 2 in G. 
If there is a vertex in C that is joined to any vertex in C apart from its succeeding 
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one, then we can remove an edge without disconnecting the graph. Thus we may 
also assume that the internal edges of C are exactly the k edges forming the cycle, 
and hence in particular that C ^ G. Define G' to be the graph obtained from G 
by contracting C to a single vertex v. Then 1 < \G'\ < \G\. At every vertex of G' 
apart from v, the degree-condition holds automatically. There is certainly one in- 
and one outgoing vertex at v by connectedness. 

If this is all, then k = 2 and the in- /outgoing edges of the cycle must be attached 
to distinct vertices by the degree condition: 



o^cQp— o 

Figure 7: Special case with circle C of length k = 2 



In this case, remove the indicated edge going in the opposite direction without 
disconnecting the graph. 

If, on the other hand, the degree-condition holds at v, our smaller contracted 
graph 67' is connected and abstract, and we can remove an edge e without discon- 
necting G' by induction. Now remove the corresponding edge from G. Since every 
path in G' lifts to a path in G, we conclude that also G\{e} is connected. □ 

We can now finish the first step of the proof: let G be a reflexive abstract MD- 
graph with M vertices and E > edges. Pick a connected component C of G 
with at least one edge and remove some edge e from C without disconnecting it by 
Lemma T4. 101 Then G\{e} is still reflexive, so by Lemma T4. 91 we can find a flow /' 
on this graph with < f'(e') < (E — 1) for all edges e'. Pick a cycle through e 
and add the corresponding flow to /' to obtain the flow / required for Step (1). 

Step (2). The next number-theoretic lemma gives a uniqueness result for the scalar 
product of integral vectors and will facilitate the definition of the edge-weight w: 

Lemma 4.11. Given a vector f = (/i, //.) G Nq of nonnegative integers, we can 
find (wi, ...,Wk) S Nq such that: 

(k k \ 

3=1 i=j ) 

and w z < 2(J2j fj + l) fc+1 for all i. 

k 

Proof. Set M = ^ fj. Let n = (M + l) k+1 and define Wj by w 3 ■ = n + (M + l)^ 1 

3=1 

for j — 1, k. The result follows by distinguishing the cases J2j A? smaller than, 
larger than, or equal to M, and using uniqueness of the (M+l)— adic representation 
in the third case. □ 

Recall the flow f on G constructed in Step (1). Label all edges other than e 
by ei, e-E-x- We now apply Lemma 14.111 to the vector (f(ei),...,f(eE-i)) to 
obtain an edge- weight w defined on all edges except for e. Give e = the weight 

E-l 

w(e E ) =-J2 f(ej)w(ej). We have a bound \w(ej)\ < 2E ( - M+1 ^ E+ V for all j. 

3=1 
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Claim. The flow f is an extremal point in Q w = conv(E w ). 

Proof. The crucial fact underlying this trick is that if N positive integers sum up 
to N, they must all be equal to 1. 

Assume that / £ E w . Let Nf = A + •■• + /« be a decomposition for the flow 
Nf with fi, /jv £ E w and N £ N. Every /j is a nonzero integral flow in E w 
and therefore must have positive flow through e to balance out the negative weight 
coming from flow through other edges. The numbers /i(e), /jv(e) are all positive 
integers and sum up to N — Nf(e). Hence /i(e) = 1 for all i. 

Since fi £ E w , this implies that the following difference vanishes: 

E-l E-l 

/*( e J'M e i) - f( e j) w ( e j) = 

3=1 3 = 1 

for all i, and we conclude fi = f for all i by the choice of w in 14. f II □ 

Step (3). We will describe hereafter how we can find a flow g £ V(x) such that the 
graph abst(<?), equipped with the flow induced by g and the edge- weight inherited 
from w Xl is equal to the flowed weighted abstract graph (G, /, w) constructed above. 
In a second step, we will deduce from Step (2) that g is extremal. 

Concretising Abstract Graphs. Recall our given vertex- weight x £ Z™. Label the 
vertices in G„ with a;— weight +1 by Li,...,L p (the "left vertices") and the ones 
with weight —1 by Ri,...,R q (the "right vertices"). Assume that G has vertices 
Vi, Vm and edges e\, .., es-x, &e = e. 

Construct a flow g £ V(x) in a step-by-step process as follows: The vertex Vi in 
G n will correspond to Li in G for i — 1,2, ...,M, so all vertex-representing nodes 
of G n lie on the left. Having implemented the edges ei, .., ej_i with the flow-vector 
assume goes from V p to V g with w ei — s. Pick \s\ vertices li,...,h s \ on 
the left and |s| vertices ri, ■■■,r\ g \ vertices on the right of G n which do not lie in 
supp(/ij_i). We always have enough vertices available since p,q > M + 3E 3ME+1 > 

M + Ef=i(Kei)| + l)- 

Define a simple path P in G n as follows: 

If s > 0, consider P — pr\l s .. .l\q. 

If s = 0, we take P = pr\q. 

If s < 0, the path we use is P = prir2...r| s | +1 g. 
Obtain the weight hi from by adding flow /(e^) to the edges of the path P. 
One checks easily that the resulting vector g = hs satisfies 

&bst(g,w x ) = {G,f,w) 

We are now finally in a position to finish off this proof: Let g = AjPj be a 
convex representation of g with (fy- £ Q x , Xj > 0,y_^ Xj = 1. We can abstract to 
find flow- vectors /j on G which induce gj as in Definition 14.81 Since the weight 
w is induced by w x , all abstractions fj must lie in Q w . We thus obtain a convex 
representation of the flow / in Q w . By Step (2), we know that / is extremal and 
hence all flows fj must be equal to /. 

From this, we immediately conclude that g = gj for all j, so g is extremal in Q x . 
This completes the proof of the Generalized Non-Classifiability Theorem 14.61 □ 
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4.3. Essential Decomposition. The principal aim of our efforts is to find a con- 
cise representation of the scl-polyhedra of elements z E M n . These are given as: 

P = P(z) = conv(L>(z) + V(z)) = conv(D(z)) + V(z) 

Here, V = V(z) is the understood recession cone of the polyhedron, and what 
we need to describe is the contribution of the disc- vectors D = D(z). The set of 
extremal points of P is the minimal set S C D + V with conv(5 + V) = P. As a 
natural alternative, we can consider the minimal set T C D+V with T+V = D + V, 
and we obtain the essential decomposition. An easy exercise shows that T consists 
of the following vectors: 

Definition 4.12. A vector d E D is an essential disc-vector if it cannot be written 
as a nontrivial sum in D + V, i.e. if d = e + v, e E D,v E V implies v = 0. 

It is immediate from the minimality of S that S is contained in T, i.e. that every 
extremal point is an essential disc-vector. The following example shows that not 
every essential disc-vector needs to be extremal: 




Figure 8: An essential, but not extremal disc-vector (average of two distinct disc- 
vectors) 



We have seen in the previous section that the set S of extremal points cannot be 
classified by the topology of the flows. Since S C T, this result extends to T. There 
is a further complication of computational nature arising for essential disc-vectors: 
the essential decomposition of the scl-polyhedra is not suitable for computational 
purposes. More precisely: 

Essential Membership Theorem. The following decision problem is coNP- com- 
plete (i.e. has HP-complete complement): "Given a word w = <j>(x,y) E Fi of re- 

2 

duced length m — 2n and an integral vector d E Z™ . Is d an essential disc-vector 
for D{x) +V(x)?" 

Proof. The input of the problem can be represented as an element s = (x,d) of 
[J (Z\{0}™ _1 ) x Nq , since y is irrelevant and x n is determined by x\, ...,x n -\. 

n 

2 

First notice that checking whether a given flow d E Nq has connected support 
can be done in polynomial time (e.g. by depth- first search) . To see that the problem 
is in coNP, assume that the answer to the problem determined by (x, d) is negative, 
where x has length n. We can check in polynomial time if d E D(x), so we may 

2 

assume that this is the case. Given a counterexample e, v E N l , we can check 
in polynomial time that e E D(x), v E V(x)\{0} and d = e + v. Therefore our 
problem is in coNP. 

To show that it is coNP— complete, we will give a polynomial time reduction 
from the following coNP-complete problem: 
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Problem 4.13. (coSUBSET - SUM) Given RcZ finite, is it true that: 

VTe7>(i?)\{0}:^i^O? 

Suppose we have a list of numbers a\, ■■■,a m and want to decide the statement: 
"No nonempty subset sums up to 0." 

m 

Set a m+ i = — di and notice that there is a nonempty subset of {a\, a m } sum- 

i=l 

ming up to if and only if there is a proper nonempty subset of {ai, a m +i} with 
vanishing sum. Without loss of generality, it is enough to only consider nontrivial 
instances of coSUBSET SUM, so to assume ^ for all k. Let n = (2(m + 1)) + 2 
and define the vertex weight x = (m, 1, ai, —1, 02, — 1, a TO +i, —1) G M n . 
Consider the vector d e D(x) determined by the flow drawn below: 




Figure 9: Reduction from coSUBSET SUM to ESSENTIAL 



Now d is an essential disc- vector if and only if there is no proper nonzero con- 
nected integral subflow e with h x (e) = 0, but these flows correspond bijectively to 
proper nonempty subsets T C {a\, <x TO +i} with ^ a t = 0. Hence, deciding if d 

is essential is equivalent to deciding coSUBSET SUM. The reduction is computable 
in polynomial time. □ 

Notice that the above polynomial reduction fails if we restrict ourselves to alter- 
nating words since there, the length m — 2n of our word grows proportionally to 
the \a,i\ and hence exponentially in the input size. 

We conclude the paper with an unrelated, but pretty conjecture we spotted: 
Conjecture 4.14. Let p, q, r 6 N and n = p + q + r. Then: 

sc\(a- n b- 1 aPba%- 1 a r b) = 1 - I£^M1 
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Appendix A. (by Freddie Manners) Proof of the NP-completeness of 

MIXED SUBSET SUM' 

A.l. Definitions. We recall the definitions of the problems SUBSET SUM, SUBSET 
SUM', VAR SUBSET SUM', and MIXED SUBSET SUM'. We further define what it 
means to consider these problems over Z fc ; for example: 

Problem A.l. SUBSET SUM over Z fe : Given vi,...,v n with i>, G Z fe , does there 
exist a nonzero vector (Ai, . . . , A„) G {0, 1}™ with J2j ^j v j = 0? 

The other problems over Z fc are defined analogously. When we wish to consider 
the original problem, we may refer to it as e.g. SUBSET SUM over Z to avoid 
ambiguity. 

It is a classical fact that SUBSET SUM over Z is NP-complete. We wish to 
investigate the complexity of MIXED SUBSET SUM' over Z. 

A.2. Proofs. We first note that SUBSET SUM and SUBSET SUM' are trivially 
equivalent. 

Lemma 1. Given vi,...,v n G Z summing to zero, (SUBSET SUM') holds iff 
(SUBSET SUM) holds onvi,..., v n -i- 

Proof. Remark that the complement of a solution to SUBSET SUM' (i.e. setting 
A- = 1 — Aj) is also a solution. □ 

We now prove a crucial lemma that allows us to consider solving simultaneous 
subset sum problems; equivalently, it allows us to work over Z fe rather than Z. 

Lemma 2. Each of the above subset sum problems (i.e. SUBSET SUM', VAR 
SUBSET SUM' or MIXED SUBSET SUM'j over Z fe has a polynomial reduction to 
the same problem over Z. 

Proof. Pick integers 1 = iVi < . . . -C N k , and set y, L = Y!j=i N j vi /' '■ Tncn if tnc 
Nj increase sufficiently fast, the problems for (yj) and (vi) are equivalent. (Note 
the Nj need not increase so rapidly that their lengths in bits are super-polynomial 
in the other inputs.) □ 

We can now present our main construction, which aims to reduce SUBSET SUM' 
(over Z) to MIXED SUBSET SUM' (over Z k for some k). Let a±, . . . ,a n G Z be an 
input to SUBSET SUM'. Consider the tabic: 





Oil 


a 2 








02 




Pn 


p 


Q 


1 


Ol 


0,2 




a n 



















2 


-1 


-1 




-1 


-1 


-1 




-1 


n 


n 


3 


-1 










-1 










1 


1 


4 





-1 










-1 







1 


1 


n + 2 










-1 










-1 


1 


1 


n + 3 


-1 


-1 




-1 













r 


n — r 



Here, the columns beyond the first — labelled by o^, P, Q — represent elements 
of Z fe for k = n + 3, and the rows are simultaneous subset sum problems to be 
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satisfied. Note that every row sums to zero; that is, (o^, ft, P, Q) are a valid 
input to VAR SUBSET SUM' or SUBSET SUM'. Finally, r is some integer in the 
range < r < n. 

We now suppose that a solution A = (A Qj , \» t , Xp, Xq) to VAR SUBSET SUM' 
exists. We show: 

Lemma 3. 

(i) Ap + Xq = I; 

(ii) X ai + Xp i — 1 for all 1 <i < n; 

(iii) J2i X az = r or n - r; 

(iv) A constitutes a solution to SUBSET SUM' on this table; 

(v) The (Xai) constitute a solution to SUBSET SUM' on a\, . . . ,«„. 

Proof. 

(i) This follows from considering row 2: if Ap + Xq = then J2j = 0, an d if 
Xp + Xq > 2 then J2j > 2n + 2, which are both forbidden. 

(ii) This follows from (i) and considering row i + 2. 

(iii) This follows from (i) and considering row n + 3. 

(iv) This follows from (i) and (ii). 

(v) That 53i^ai a i = follows from considering row 1. Then, recalling that 
< r < n, (ii) and (iii) give the other constraints. 

□ 

As a direct consequence of part (iv) of this lemma, we see that the table is a 
valid input to MIXED SUBSET SUM'. We now show a converse to part (v): 

Lemma 4. If fii, . . . , fj,k is a solution to SUBSET SUM' on a\, . . . ,a n , then for 
some choice of r, we can construct a solution to SUBSET SUM' on the table. 

Proof. This is straightforward: take r = ^2 i t L i 1 X ai — fa, Xp i = 1 — m, Xp = 1, 
and Aq = 0. □ 

We can now state and prove our main result: 

Theorem 5. SUBSET SUM' overTL has a polynomial reduction to MIXED SUBSET 
SUM' over Z k (for k = n + S). 

Proof. By Lemmas [3] and 01 a\,...,a n has a solution to SUBSET SUM' iff the 
table has a solution to MIXED SUBSET SUM' for some value of r (0 < r < n). So, 
running an oracle for MIXED SUBSET SUM' at most n — 2 times gives a solution 
to the S U BS ET S U M ' problem. □ 

Corollary 6. The problem MIXED SUBSET SUM' over Z is NP-complete. 

Proof. We use Lemma [TJ theorem [S] and Lemma [5] to give reductions from SUBSET 
SUM over Z, to SUBSET SUM' over Z, to MIXED SUBSET SUM' over Z fe , to MIXED 
SUBSET SUM' over Z in that order. 

(That MIXED SUBSET SUM' is in NP is clear.) □ 
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